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ABSTRACT 


Nonparametric  estimation  of  the  hazard  rate  of  a 
lifetime  distribution  based  on  right-censored  data  is  discussed. 
The  methods  considered  include  maximum  likelihood,  kernel-type, 
Bayesian,  and  histogram  estimators.  Recent  results  for  kernel- 
type  estimators  are  presented  and  compared. 


> 


1.  INTRODUCTION 


J  Nonparametric  estimation  of  the  hazard  rate  or  failure  rate  is  a 
frequent  topic  of  investigation  in  the  statistical  literature  because  of 
its  practical  importance.  Until  quite  recently,  hazard  rate  estimation 
had  been  based  on  complete  samples  of  n  independent  identically  distri¬ 
buted  lifetimes.  However,  observations  may  be  censored  or  truncated  in 
many  life  testing  situations  (for  example,  see  Lagakos,  1979).^  This 
occurs  often  in  medical  trials  when  the  patients  may  enter  treatment  at 
different  times  and  then  either  die  from  the  disease  under  investigation 
or  leave  the  study  before  its  conclusion.  A  similar  situation  may  occur 
in  industrial  life  testing  when  items  are  removed  from  the  test  at  random 
times  for  various  reasons.  It  is  of  interest  to  be  able  to  estimate  non- 
parametrically  the  unknown  hazard  rate  of  the  lifetime  random  variable  from 
this  type  of  data  without  ignoring  or  discarding  the  right-censored  infor¬ 
mation. 

The  purpose  of  this  paper  is  to  discuss  nonparametric  estimation  of 
the  hazard  rate  function  for  right-censored  samples.  The  various  types  of 

estimators  that  have  been  proposed  in  the  literature  will  be  indicated  and 

rj,zsz. 

briefly  discussed  in  Section  3.  They-  include  maximum  likelihood  estimators, 
kernel  type  estimators,  Bayesian  estimators,  and  histogram  estimators. 

Due  to  their  computational  simplicity  and  other  properties,  the  kernel-type 
hazard  rate  estimators  will  be  emphasized.  Results  of  Tanner  (1983)  and 
Tanner  and  Wong  (1983,  1984)  will  be  presented  in  Section  4  while  the  esti¬ 
mator  considered  by  McNichols  and  Padgett  (1981)  will  be  discussed  in 


Section  5. 


2.  NOTATION  AND  PRELIMINARIES 


Let  X^,...,X°  be  the  survival  times  of  n  items  or  individuals 

which  are  censored  from  the  right  by  a  sequence  of  random  variables  U^,...,Un. 

Let  F°  denote  the  common  unknown  absolutely  continuous  distribution  function 
o 

of  the  X^  s  which  are  assumed  to  be  independent  and  nonnegative,  and  let 

f°  be  a  corresponding  density  function.  The  observed  data  will  be  denoted 

by  the  pairs  (X^Aj)  ,  i-1, . . .  ,n  where 

(1  if  X°  S  U 


X.  »  min{x°  U.},  A.  - 

i  i  i  i 


*0  if  X°  >  Ui. 


Thus,  it  is  known  whether  a  particular  observation  is  a  time  of  failure  (or 
death)  or  a  loss  time  (censored  time).  The  nature  of  the  U^'s  determines 
the  type  of  censoring,  (i)  If  are  fixed  constants,  the  observa¬ 

tions  are  time  truncated.  If  all  ' s  are  equal  to  the  same  constant,  then 
the  case  of  Type  I  censoring  results,  (ii)  If  all  *  X(r)  ’  t*ie  r— 
order  statistic  of  X^,...,X°,  then  the  situation  is  that  of  Type  II  censor¬ 
ing.  (iii)  If  U^,...,Un  constitute  a  random  sample  from  a  distribution 
H  (which  is  usually  unknown)  and  are  independent  of  X^,...,X°,  then 
(X^.A^,  i“l,...,n,  is  called  a  randomly  censored  sample.  Assuming  (iii), 
A^,...,An  are  independent  Bernoulli  random  variables  and  the  distribution 

function  F  of  each  X^,  i*l,...,n,  is  given  by  1-F  =  (1-F°) (1-H) .  If  it 

o  0 

is  assumed  that  there  is  a  positive  constant  6  such  that  1-H  »  (1-F  )  , 
then  this  is  called  the  Koziol-Green  (1976)  model  of  random  censorship  (which 
is  the  proportional  hazards  assumption  of  Cox,  1972) .  Chen,  Hollander,  and 
Langberg  (1982)  indicated  that  the  pairs  (X°,U^),  1-1,..., n,  follow  this 


proportional  hazards  model  if  and  only  if  (X-,...,X  )  and  (A,,..., A  ) 

in  In 

are  independent.  This  Koziol-Green  model  of  random  censorship  arises  in 
several  situations  (Efron,  1967;  Csorgd*  and  Horvath,  1981;  Chen,  Hollander 
and  Langberg,  1982)  and  will  be  referred  to  in  Section  5.  In  practice, 
however,  this  model  may  be  somewhat  restrictive. 

Next,  let  2^,...,Zn  denote  the  ordered  X^,...,Xn  with  corresponding 

f  f  A 

A-values,  A^,...,Aq.  The  Kaplan-Meier  (1958)  estimator  PQ(t),  or  product- 
limit  estimator  (Efron,  1967),  of  the  survival  probability  1-F°(x)  at  x  >  0 
is  given  by 

11,  0  5  x  <  Z1 

ijJ1  (n-i+l)  i‘  x  €  (Zk-l,Zk]’  k=2,...,n 
0,  x  >  Zq. 

A  A 

Also,  Fn(x)  *  1  -  PQ(x)  will  denote  the  product-limit  estimator  of  F°(x). 


3.  BRIEF  REVIEW  OF  THE  LITERATURE 

The  hazard  rate  function  of  the  lifetime  distribution  F°  is  defined 
by  r°(x)  =  f°(x) / [1-F°(x) ] ,  x  2  0.  Several  authors  have  considered  the  non- 
paramecric  maximum  likelihood  estimation  (MLE)  of  r°(x)  assuming  the  random 
censorship  model.  Padgett  and  Wei  (1980)  obtained  the  MLE  of  r°  in  the  class 
of  increasing  failure  rate  (IFR)  distributions,  while  Mykytyn  and  Santner 
(1981)  derived  the  MLE  of  r°  assuming  either  an  IFR,  DFR,  or  U-shaped 
failure  rate. 

Bartoszynski,  Brown,  McBride  and  Thompson  (1981)  investigated  the  problem 
of  estimating  the  intensity,  or  rate,  function  of  a  nonstationary  Poisson 


4 


process  arising  in  cancer  studies  under  a  type  of  right  censorship.  They 
considered  a  kernel  estimator,  a  constrained  maximum  likelihood  procedure, 
and  a  discrete  maximum  penalized  likelihood  approach. 

Kernel-type  estimators  have  been  perhaps  the  most  popular  estimators 
in  practice  due  to  their  relative  computational  simplicity,  smoothness,  and 
other  properties.  Blum  and  Susarla  (1980)  were  the  first  to  consider  a  kernel- 
type  estimator  of  r°  assuming  randomly  right-censored  data.  Some  other 
kernel-type  estimators  for  this  case  were  obtained  by  Foldes,  Rejto  and 
Winter  (1981) ,  McNichols  and  Padgett  (1981) ,  Tanner  (1983) ,  Tanner  and  Wong 
(1983,  1984),  and  Schafer  (1985).  These  particular  estimators  will  be  dis¬ 
cussed  in  more  detail  in  Sections  4-6. 

Burke  (1981)  considered  a  competing  risk  model  (see  also  Burke  and 

Horvath,  1982)  in  which  the  hazard  rate  estimator  for  the  jth  risk  was  either 

a  kernel-type  estimator  or  was  of  the  form  r.  (x)  ■  f  (x)/(l-F  (x)),  where 

jn  jn  n 

fjn(x)  was  a  kernel  density  estimator  and  Fq  was  the  empirical  distribution 
function  of  X^,...,Xn. 

Yandell  (1983)  obtained  general  delta  sequence  curve  estimators  and 
generalized  the  results  of  Watson  and  Leadbetter  (1964)  and  Rice  and  Rosenblatt 
(1976)  to  the  arbitrarily  right-censored  case.  Using  the  martingale  technique 
developed  by  Aalen  (1978) ,  Ramlau-Hansen  (1983)  considered  the  general  multi¬ 
plicative  intensity  model  which  included  the  kernel-type  estimators  of  Foldes, 
Rejto  and  Winter  (1981)  and  Yandell  (1983). 

Bayesian  nonparametric  extimators  of  the  hazard  rate  function  of  the 
lifetime  distribution  have  been  obtained  by  Padgett  and  Wei  (1981)  and  Dykstra 
and  Laud  (1981)  for  right-censored  samples.  Padgett  and  Wei  (1981)  used  pure 
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jump  processes  as  prior  distributions  on  the  hazard  rate  function  which  was 
assumed  to  be  increasing,  while  Dykstra  and  Laud  (1981)  considered  an  ex¬ 
tended  gamma  process  as  a  prior.  Lo  (1978)  also  used  a  Bayesian  nonparametric 
approach  by  constructing  a  prior  random  density  as  a  convolution  of  a  kernel 
function  with  the  Dirichlet  random  probability.  This  technique  could  be 
applied  to  complete  or  censored  samples. 

In  a  recent  paper  Liu  and  Van  Ryzin  (1985)  obtained  a  histogram  esti¬ 
mator  of  the  hazard  rate  function  from  randomly  right-censored  data  and  gave 
an  efficiency  comparison  of  their  estimator  with  the  kernel  estimator  of  the 
hazard  rate.  Also,  Liu  and  Van  Ryzin  (1984)  gave  the  large  sample  theory  for  the 
normalized  maximal  deviation  of  a  hazard  rate  estimator  under  random  censoring 
which  was  based  on  a  histogram  estimate  of  the  subsurvival  density  of  the 
uncensored  observations. 


4 .  KERNEL-TYPE  ESTIMATORS 


Tanner  and  Wong  (1983)  considered  the  kernel-type  estimator  of  r  given 


by 


/.  ,  o  I  »  , 

rQ(x)  -  h A  (n-j+1)  Aj  KC-jp1), 


-U 


where  K  was  a  symmetric  nonnegative  kernel,  K(t)  *  o(t  )  as  t  -*■  00 , 
and  \  K(t)dt  *  1.  The  failure  rate  r°  was  assumed  to  be  continuous  and 

•  —OP 

x  satisfied  0  <  F(x)  <  1.  The  following  results  for  the  mean  and  variance 


of  r  (x)  were  obtained: 
n 


7  '?  V 


E[rn(x)]  -  h'1  /(l-Fn(y))r°(y)K(^)dy, 

Var[rQ(x)  ]  -  h"2  /ln(F(y))r°(y)K2(^-)dy  +  2h'2  //  z(Fn(z)  -  Fn(y)Fn(z) 

-  F(2)-F(y)  CFn(z)-Fn(y)]>  r°(y)r°(z)K(^)K(^)dy  dz 

where 

Iq(F)  *  ^:J(n-k)"1(^)Fk(l-F)n_k. 

In  order  to  investigate  the  limiting  behavior  of  these  expressions  the 
concept  of  compatibility  was  defined  and  is  given  below. 

Definition.  K  is  said  to  be  compatible  with  a  cdf  F  if  for  any  M  >  0, 
there  exists  h  small  enough  such  that  h  ^K(h  ’*'(y-x))/(l-F(y))  is  uniformly 
bounded  for  |y-x|  >  M. 

Suppose  the  h^'s  satisfy  hQ  -►  0  and  nhQ  -►  08 .  Tanner  and  Wong  (1983) 

A 

then  showed  that  rQ(x)  was  an  asymptotically  unbiased  estimator  of  r  (x) 
whenever  K  was  compatible  with  F°.  Furthermore,  if  in  addition  K  was 
also  compatible  with  H,  then 

Var(r  (x) )  *  (nh)-1  (/K2(t)dt)r°(x) (l-F(x))”1  +o((nh)  1) . 
n  ' 

A  ^ 

Thus,  r^(x)  was  a  mean-square  consistent  estimator  of  r  (x)  provided 
the  aforementioned  compatibility  conditions  held.  In  addition,  by  using 
Hajek’s  projection  method.  Tanner  and  Wong  (1983)  proved  the  asymptotic 

A 

normality  of  r  (x)  (assuming  K  was  compatible  with  F°  and  H) . 
n 

Tanner  and  Wong  (1984)  also  studied  a  class  of  estimators  of  the  same 
/\ 

general  form  as  r^Cx)  with  K(y/h)  replaced  by  K(y/9),  where  9^  was 


a  positive-valued  "smoothing  vector"  chosen  to  maximize  a  likelihood  function. 
Hence,  for  this  estimator  the  smoothing  parameters  were  chosen  based  on  the 
observed  data.  Due  to  their  complexity,  these  data-based  estimators  were 
studied  by  computer  simulations . 

Tanner  (1983)  considered  a  modified  kernel-type  estimator  of  the  form 

;nw .  (2V1  «(*-  v2v- 

where  was  the  distance  from  x  to  the  kth  nearest  of  the  uncensored 

observations  among  X^,...,Xq.  This  estimator  allowed  the  data  to  play  a 
role  in  determining  the  degree  of  smoothing  that  would  occur  in  the  estimate. 
Assuming  that  F°  and  f°  were  continuous  in  a  neighborhood  about  x, 
k  =■  [na] ,  %  <  a  <  1,  where  [•]  was  the  greatest  integer  function,  that 
K  had  bounded  variation  and  compact  support  on  the  interval  [-1,1],  and 

Q  A 

that  r  was  continuous  at  x,  it  was  shown  that  r  (x)  was  strongly 

n 

consistent. 

In  a  recent  paper  Schafer  (1985)  considered  a  modified  kernel-type 
estimator  where  the  bandwidths  were  stochastic  but  did  not  depend  on  the 
point  of  interest  x  (as  Tanner's  1983  estimator  did).  More  specifically, 
Schafer  (1985)  suggested  the  estimator 


r  (x)  *  I  (n-j+ir1  L 

j=l  n,j 


where 


Vj  -  inf  {rlH^+tJ-W  a  pn). 


pn  is  a  sequence  of  positive  numbers  for  which  pu  ■*  0,  and  Hn  is  the 


empirical  cumulative  hazard  function  introduced  by  Nelson  (1969) .  Schafer 
commented  that  the  bandwidths  suggested  by  Tanner  (1983)  had  the  disadvantage 
of  being  "biased"  by  the  censoring  distribution  in  the  sense  that  they 
adapted  to  the  conditional  density  of  the  random  variables  X°  under  the 
condition  X°  <  IL  of  being  uncensored,  rather  than  to  the  hazard  function 
r°  to  be  estimated.  Using  the  bandwidths  suggested  by  Schafer  (1985),  the 
uniform  strong  consistency  of  his  estimator  could  be  proven.  More  precisely, 
his  result  was  the  following. 

Theorem.  Choose  such  that  p^  log(n)/n  -*•  00 .  Let  K(  •)  be  a  bounded 

Riemann  integrable  function  with  compact  support  on  the  interval  [-1,1]. 

Let  the  density  functions  of  both  the  X°  and  the  be  continuous  everywhere, 

A 

and  let  0  <  b1  <  b,  with  P(U.  <  b_)  <  1.  Then  sup  |  r  (x)  -r°(x)|  -*■  0 
12  12  xe[brb2]  n 

with  probability  one. 

5.  KERNEL  ESTIMATION  UNDER  THE  PHM 

In  this  section  we  will  discuss  the  results  of  McNichols  and  Padgett 
(1981) .  Here  the  Koziol-Green  model  or  proportional  hazards  model  (PHM) 
defined  in  Section  2  will  be  assumed.  Recall  that  this  assumption  is  equivalent 
to  assuming  that  X  =  (X^,...,Xq)  and  &  =*  (A^,...,Aq)  are  independent. 

Although  they  are  somewhat  restrictive  in  practice,  proportional  hazards 
models  have  been  used  frequently  in  censored  data  problems  to  derive  procedures 
and  to  study  their  theoretical  properties.  For  the  two  sample  test  problem, 
Efron  (1967)  computed  the  Pitman  efficiency  of  three  competing  test  statistics 
under  PHM.  Koziol  and  Green  (1976)  derived  the  Cramer-von  Mises  statistic 
corresponding  to  the  Kaplan-Meier  product-limit  estimate.  Csorgo' and  Horvath 
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(1981)  constructed  empirical  exact  confidence  bands  for  the  life  distribution 
of  the  censored  variable  and  also  gave  rates  of  convergence  for  a  wide  class 
of  functionals  including  the  Cramer-von  Mises  functional.  Chen,  Hollander, 
and  Langberg  (1982)  obtained  small  sample  properties  of  the  Kaplan-Meier 
estimator  (in  particular,  the  otth  moment),  while  Cox  (1972)  derived  procedures 
for  regression  models  under  a  proportional  hazards  assumption.  Such  models 
also  arise  naturally  due  to  the  type  of  system  under  study  in  some 
reliability  problems . 

Throughout  this  section  let  r^(x)  denote  the  hazard  function  estimator 
given  by  Watson  and  Leadbetter  (1964a, b)  for  the  uncensored  case;  that  is, 
considering  the  Z.'s  as  order  statistics  of  a  complete  random  sample  from  F, 


~  n  -  x-Z. 

r  (x)  -  h  l  (n-j+1)  K(— r-1),  x  >  0. 

n  j=l  h 


Let  K(t)  be  a  suitable  kernel  and  h  *  hQ  be  a  positive  sequence.  Based 
on  the  arbitrarily  right-censored  sample  (X^A^),  i=l,...,n,  McNichols  and 
Padgett  (1981)  considered 


A 

f*  ,  „  dF  (t) 

(x)  *  J  h”1  K<5=£.)  - S - 

1  Li-  h  (i-F  (t; 


(t))  J 


Ito,°°) (x) 


as  an  estimator  of  the  hazard  function  r  (x) .  This  was  understood  to  be 


rn(x) 


-1 


n 


x-Z. 


J-l 


I[0,®) (x)’ 


(5.1) 


where  I  (•)  is  the  indicator  function  of  the  set  A, 

A 

/\  A 

P  (Z.)  -  P  (Z. .-)  .  A’ 

.  _2_1 - n_l±^  .  t  3,  j.i . 

1  P  (Z.)  ”  J+1 

nv  J 


and  v 


1. 
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The  limiting  behavior  of  (5.1)  was  considered  under  Parzen's  (1962) 

conditions  on  the  kernel  function  K(t)  and  the  sequence  {h  =  h  }  and 

n 

Watson  and  Leadbetter's  (1964a, b)  conditions  on  K  and  F.  These  results 
are  given  in  the  next  two  theorems. 

Theorem  5.1.  Let  K(t)  be  a  Borel  function  satisfying  the  conditions 


(i) 

sup  |K(t)  |  <  °° 

-aX  C<°° 

(ii) 

00 

/  |K(t)  |dt  <  “>, 

—CD 

(iii) 

lim  |t  K(t) |  =  0 

t-xo 

and 

(iv) 

00 

/  K(t)dt  =  1. 

—CD 


lim  E[r  (x) ]  =  a  r(x)  =  r°(x),  x  S  0. 
n 

n-»oo 


then 


Outline  of  Proof:  Note  that  r  (x)  is  a  function  of  Z . Z  and 

"  n  in 

A^ . ^n-1*  ^lso»  since  A^  and  X  are  independent  under  PHM,  Z  and  A/ 

independent .  Thus , 


E[r,u>l  -  Ez>A,[fn(x)]  -  h'1  l  Ea.  <v,)E,  [W-j-J-)] 


,  A.  j  Z.  h 

-1  J  J 


h  l  EA  ,(vi)  e7  kc-t1)]  +  h  e7  [K(— ir)] 

j-l  1  2  Zj  h  Zn  h 


,  x-Z 

+  a  ti  Ez  [K( — ] . 
n 


(5.2) 


But  EA,(Vj)  -  (n-j+1)"1  P[A^  *  1]  +  0  •  P[A^  *  0]  -  aCn-j+l)-1, 
for  j«l,...,n-l  so  that  (5.2)  becomes 


a  E[r  (x)]  +  (l-a)h  Ez  [K(— ^— )]  -*■  a*r(x) 
n  n 

as  n  -*■  °°  by  Watson  and  Leadbetter’s  (1964a, b)  results  and  the  fact  that 

x**“Z 

h_1  Ez  [K(— jj-S)  ]  -  0. 
n 


By  the  PHM, 


Therefore, 


l-F(x)  -  [1-F°(x) ] [l-H(x) ]  -  [1-F°(x)]1+6. 


f(x)  -  (1+6)  f°(x)  [1-F°(x)]e, 


a  r(x)  ’  HuT  m  *<1+6)f°Jx)^x)]  «  r°(x) , 
1_F(X)  [1-F°(x)]1+6 


since  a  =  P[X°  £  U . ]  ■  (1+6)  (Csorgo  and  Horvath,  1981) 


Note  that 


r(x)  =•  r  (x)  +  (l^AJh-1  K(-~) ,  (5.3) 

an  r  n 

A  A 

where  r  (x)  is  the  estimator  of  Tanner  and  Wong  (1983) .  Thus  r  (x)  *  r  (x 
n  n  n 

whenever  A'  *1,  that  is,  whenever  the  largest  observation  is  uncensored, 
n, 

Also,  (5.3)  and  the  fact  that  a  r(x)  -  r°(x)  under  the  PHM  assumption 
account  for  the  somewhat  different  expressions  (for  the  mean  and  variance) 
obtained  by  McNichols  and  Padgett  (1981)  and  Tanner  and  Wong  (1983) .  The 
conditions  on  K  assumed  by  Tanner  and  Wong  were  essentially  Parzen's  (1962) 
conditions.  Also,  under  the  PHM, 

(i)  r(x)  is  continuous  implies  that  r°(x)  is  continuous 

and 

(ii)  K  is  compatible  with  F  implies  K  is  compatible  with  F°. 


The  following  result  was  proven  for  the  second  moment  of  rR(x) • 

Theorem  5.2.  Let  K(t)  satisfy  the  conditions  of  Theorem  5.1.  Then 
for  x  £  0, 

E[r2(x) ]  *  a2  E[rn(x)]2  +  (a-a2)pln  +  0-“a)En  +  P3n> 

where 


pln  =  h"2  l  (^Mn-j+ir1  K2(~-)f(t)[l-F(t)]n"jF(t)j_1dt, 

Xv2« 

_-2 /X—t N  x  __  —  / _ \  n— 1 ,  _  rtr 2  / _ i 


n 


h’2  K2(.~)f(t)n  F(t)n"1dt  -  h"2  Ez  [K2(— jp)]. 


and 


t„  n-1 


*3n  “  2<a-a2>h  2  !q!  o  (n-i+1) (i-1) ! (n-i-1) !  F(tl)  tF(t2) 
-  F(t1)]n  1  1  Kf>n(tL,x)  Kf ,n(t2,x)dt1  dt2. 


* 


with  K,  (t,x)  ■  f (t)K((x-t)/h) .  In  addition 
f,n 

lim  E(r2(x) ]  *  [lim  E  rQ(x)]2  -  [r°(x)]2,  x  >  0, 

n-H»  Q-H» 

if  nh  00  and  n  •  sup  1  |y  K(y)  |  0  as  n  -*■  <»  for  £  >  0  fixed. 

n 

Corollary  5.3.  Under  the  conditions  of  Theorem  5.2,  for  x  >  0, 

lim  E[r  (x)  -  r°(x)]2  -  0. 
n-x»  n 

Note  that  the  last  condition  on  K  in  Theorem  5.2  is  satisfied  for 

kernels  with  finite  support  and  also  for  the  normal  kernel  for  choices  of 

such  as  h  *  n  It  should  be  noted  also  that  nh  +“  is  Parzen's 

n  .  n 

(1962)  condition.  In  addition.  Theorem  5.2  implies  that  var[r  (x)]  -*•  0 

n 

as  n  -*•  <*  for  each  x  >  0. 


Outline  of  Proof  of  Theorem  5.2:  Squaring  r^(x)  from  (5.1)  and  taking 
the  expectation. 


E[r2(x)  ]  -  h"Z  l  E.(vf)  E7  [K4(-j —L)] 


-2 


j-1 


'i'T  2j 


2 


+  2h"2  l  l  E.(v  v  )  E7  7  [K(— r— ^)K(— r— ^-)  ] .  (5.4) 

t<5  4  i  i  VZj  h  h 

Now,  the  first  term  in  (5.4)  can  be  written  as 


n  n 


x-Z.  x-Z^ 


7  n  7  n  X-Z  ,  ry  ry  X—  Z 

all  l  (n-j+l)~Z  E  [k  (— r-^-)  ]  +  (l-a)li  Ez  [KZ(-^)] 

J-l  j  n 


=  a  pln  +  (l-a)En 


a2  pln  +  (a-a2)pln  +  (l-a)En. 


(5.5) 


Next, for  i  <  j,  j-2,...,n-l, 

E^VfVj )  -  E^.C^)  .  Cv-j  >  -  a2  (n-i+1)  ”1  (n-j+1) _1 

since  is  a  function  only  of  A^,  and  A^  and  A^  are  independent. 

Also,  E^vivn^  *  E^'  (v^)  "  a  (n-i+1)  Thus,  the  second  term  in  (5.4) 

-  n-1  n-1  x-Z.  x-Z. 

2h  I  I  ea<W  )  E  [KC-r-^KC-r-1)] 

i<j  A  1  2  Zi.Zj  h  h 


9  n-1  x-Z.  x-Z 

+  2h“  l  E  ,  (v  )E  [K(-j-i)K(-r-^ 
i-1  Ai  1  2i’Zn  h  h 


-)] 


7  7  i  ^“2,  X-Z 

+  2h“Z  l  a  (n-i+1)  Ez  z  [K(^-^)K(-~)  ] 
i-1  i *  n 


where 


and 


2a  ^2n  +  ^3n’ 


P2ll  •  h“2  H  (n-i+1)”1  (n-j+1)"1  Ez^_z  [K(-l-^)K(-jl^) ] 


7  7  1  X“Z,  X-Z 

P3n  -  2(a-a  jti  l  (n-i+1)'1  Ey  7  [K(-T-i)K(-lr^)  ] .  (5.6) 


i-1 


Zi’Zn 


Thus,  from  (5.4)  -  (5.6)  we  have 


E[r2(x) ]  -  a2E[rn(x)]2  +  (a-a2)pln  +  (l-a)EQ  +  p3n  (5.7) 

iw  2 

since  p^n  +  2pZn  is  simply  E[rn(x)]  .  This  then  implies  that 


lim  E(r  (x) ]  -a  r  (x)  -  [r°(x)] 
n 

n+» 


becomes 


since  it  can  be  shown  that  the  p.  's,  E  's  and  the  p,  '  s  converge  to  zero 
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as  n  -*■  oo.  Thus,  under  the  PHM 

E[?n2(x)]  -  tr°(x)]2.  /// 

Note  that  Tanner  and  Wong  (1983)  required  that  K  be  compatible  with 

A 

both  F°  and  H  in  order  to  derive  their  results  for  Var[rQ(x)].  The 
compatibility  condition  of  K  with  H  is  different  from  the  supremum  condition 
on  K  in  Theorem  5.2. 

6.  CONCLUSION 


Numerous  methods  for  estimating  the  hazard  rate  of  a  lifetime  distri¬ 
bution  based  on  right-censored  data  are  now  available  in  the  literature. 
These  results  are  all  fairly  recent  and  appear  to  be  growing  rapidly.  As 
the  results  in  Sections  4  and  5  indicate,  the  kernel  method  has  been  most 
widely  studied  and  perhaps  provides  the  smoothest  nonparametric  estimates 
of  the  hazard  rate  .  They  are  simple  to  compute  for  a  given  bandwidth  hQ. 
An  Important  problem  which  should  be  studied  with  respect  to  this  estimator 
is  how  to  optimally  select  the  bandwidth  for  a  given  right-censored  sample. 
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